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Abstract — The performance of multiuser systems is both diffi- 
cult to measure fairly and to optimize. Most resource allocation 
problems are non-convex and NP-hard, even under simplifying 
assumptions such as perfect channel knowledge, homogeneous 
channel properties among users, and simple power constraints. 
We establish a general optimization framework that system- 
atically solves these problems to global optimality. The pro- 
posed branch-reduce-and-bound (BRB) algorithm handles general 
multicell downlink systems with single-antenna users, multi- 
antenna transmitters, arbitrary quadratic power constraints, 
and robustness to channel uncertainty. A robust fairness -profile 
optimization (RFO) problem is solved at each iteration, which 
is a quasi-convex problem and a novel generalization of max- 
min fairness. The BRB algorithm is computationally costly, but 
it shows better convergence than the previously proposed outer 
polyblock approximation algorithm. Our framework is suitable 
for computing benchmarks in general multicell systems with or 
without channel uncertainty. We illustrate this by deriving and 
evaluating a zero-forcing solution to the general problem. 

Index Terms — Branch-reduce-and-bound, dynamic coopera- 
tion clusters, fairness-profile. Network MIMO, optimal resource 
allocation, performance region, worst-case robustness. 



I. Introduction 

RESOURCE allocation is generally very difficult in mul- 
tiantenna systems. First of all, it is non-obvious how to 
measure multiuser system performance. In information theory, 
the sum capacity provides the highest reliable data throughput 
|[T], regardless of the computational complexity and delay 
resilience required for implementation. Signal processing mea- 
sures such as the mean squared error (MSE) are, on the 
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Other hand, only vaguely connected to the user-experienced 
service quality Q. Secondly, multiuser systems are limited 
by interference, requiring considerations between optimizing 
total performance and guaranteeing individual user service. 
Cellular users are often highly heterogeneous, both in average 
channel gain and delay sensitivity f3], making it tricky to 
even define fairness among users. Thirdly, simplifying assump- 
tions on channel state information (CSI), power constraints, 
synchronization, and performance measures are required to 
achieve tractable mathematical problems. 

A key to efficient performance optimization is to formu- 
late it as a convex problem, making the global solution 
achievable through efficient algorithms |4|. Convex formula- 
tions for downlink transmission were developed in Q and 
gradually extended in |6|-|8| to general power constraints 
and multicell conditions. Efficient algorithms, based on fixed 
point iterations, were developed in (|9), pO| . The convexity 
was achieved by assuming perfect CSI and pre-defined user 
performance constraints, thus ignoring how to select these 
optimally. The extension to maximizing the worst performance 
among all users is achieved by solving a series of these 
convex problems ||7|-p0l. The requirement of perfect CSI can 
also be relaxed using robust optimization techniques pTj . By 
assuming ellipsoidal uncertainty regions, convex formulation 
to the aforementioned problems can be achieved under worst- 



case robustness 1 12|-p6). In particular, p4|-| 16 1 discuss such 
robustness in a few special multicell scenarios. Robustness 
can also be defined probabilistically (i.e., with outage prob- 
abilities), but (conservative) bounds and approximations are 
required to achieve convex formulations in these cases [17|- 

Based on the references above, the multicell resource al- 
location can be solved in polynomial time either under fixed 
user performance constraints or if the goal is to maximize the 
worst (i.e., max-min) performance among users. Under general 
system performance measures, the global solution cannot 
be achieved efficiently; pO| shows that sum performance, 
proportional fairness, and harmonic mean optimizations are all 
NP-hard problems. However, such problems can still be solved 
with global convergence and optimality using the framework 
of monotonic optimization, developed in |2Tj , p2| . The outer 
polyblock approximation is an algorithm in this framework 
[21], and applications to single-cell |23|, ||24| and multi-cell 
transmission |25|-|j27j with perfect CSI have appeared in 
literature. Unfortunately, the polyblock algorithm requires a 
very large number of iterations to achieve accurate results, 
thus limiting usage to systems with no more than a handful 
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of users 0221 



A. Notation 



In this paper, we propose a robust monotonic optimization 
framework for general multicell scenarios with imperfect CSI. 
The framework can be appHed for any system performance 
measure that increases monotonic ally in the performance of 
each user, which of course is satisfied by all reasonable mea- 
sures. Convergence to the global optimum is achieved through 
a branch-reduce-and-bound (BRB) algorithm that builds upon 
previous work in f22], and we show far better convergence 
than the polyblock algorithm in 1 24 |-p7) . Each iteration of 



the BRB algorithm solves a quasi-convex subproblem. It is 
called a robust fairness-profile optimization, meaning that each 
user has a constraint on the lowest acceptable performance 
level and attains a predefined percentage of all performance 
above these levels. We show how to formulate this problem 
efficiently under worst-case robustness, extending results in 
p8), p9| for perfect CSI. Observe that the BRB algorithm 



solves a high-complexity (NP-hard) problem and is therefore 
mainly useful as a benchmark in system level evaluations of 
suboptimal low-complexity algorithms, although good lower 
bounds on the optimal solution is achieved in a few iterations. 

The structure and contributions of the paper are: 

• The general multicell system model of |j6), pO) with 
dynamic cooperation clusters is introduced in Section [H] 
User performance is measured by arbitrary monotonic 
functions of the worst-case MSE and system performance 
is an arbitrary monotonic function of each user's perfor- 
mance. The concept of a robust performance region is 
defined and important properties are proved. 



In Section III robust fairness-profile optimization (RFO) 
is introduced as a novel extension to standard max- 
min performance optimization problems. This problem 
is shown to be quasi-convex under worst-case robustness 
and a simple solution algorithm is given. 



In Section IV a novel framework for solving general 
robust monotonic optimization problems is proposed. 
Convergence to the global optimum of this NP-hard prob- 
lem is achieved by a branch-reduce-and-bound (BRB) 
algorithm over the robust performance region. 
To find initial performance bounds and illustrate the 
benchmarking capability of the BRB algorithm. Section 
|V] derives an approximation of the general optimization 
problem. By adding interference constraints and pretend- 
ing to have perfect CSI, a convex formulation is achieved. 
The proposed framework is evaluated numerically in Sec- 
tion fVI The robust performance region is illustrated and 



the strategy of Section [V] compared with the global op- 
timum. The computational complexity of RFO is shown 
to be manageable and the BRB algorithm shows better 
convergence than the polyblock algorithm in p6], p7). 



We have previously applied this framework to the different 
problem of robust coordinated beamforming with perfect in- 
tracell CSI and uncertain intercell CSI; see |31 1. The previous 
paper maximized functions of the signal-to-interference-and- 
noise ratios (SINRs), instead of functions of the MSEs. 



Boldface (lower case) is used for column vectors, x, and 
(upper case) for matrices, X. Let X-^, X^, and X* denote 
the transpose, the conjugate transpose, and the conjugate of 
X, respectively. For Hermitian square matrices X, Y, X )^ Y 
and X ^ Y means that X — Y is positive definite and semi- 



definite, respectively. 1^/ , Om € 



vMxM 



denote identity and 



zero matrices, respectively. The L^-norm of x is ||x||i. 1m G 
jjAfxi jg ^ vector with ones. The set of non-negative real 



n-dimensional vectors is denoted 



Element-wise (strict) 



inequality for vectors x, y is denoted x < y (x < y). 

II. System Model & Performance Measures 

We consider a multiple-input-single-output (MISO) system 
with Kf transmitting base stations and receiving users. The 
jth base station is denoted BSj and has antennas. The total 
number of transmit antennas is = J2j=i ^j- The fcth user 
is denoted MSk, has a single (effective) antenn^ and is a 
simple receiver: 

Definition 1. A simple receiver decodes its designated signal 

• As consisting of a scalar-coded data symbol Sk multiplied 
with a transmit beamforming vector v^: G C^^^; 

• While treating co-user interference as noise (i.e., without 
trying to decode and subtract interfering signals). 

Under these conditions, the transmission should obviously 
satisfy the first property. The use of transmit beamforming is 
actually optimal under single-user detection (i.e., the second 
property) if perfect CSI is available (|6), 



321, 1 33 1, while 



1 34 1 provides conditions on its optimality under channel un- 
certainty. From an information theoretic perspective, transmit 
beamforming and simple receivers are suboptimal p5) but 
these assumptions are of practical importance to achieve low- 
complexity receivers and power efficiency. 

In a general multicell scenario, some users are served in 
a coordinated manner by multiple transmitters. In addition, 
some transmitters and receivers are very far apart, making it 
impractical to estimate and separate the interference on these 
channels from the noise. To capture these properties, we apply 
the dynamic coordination framework of Q, |30J : 

Definition 2. Dynamic cooperation clusters means that BSj 

• Has channel estimates to receivers in Cj C {1,...^K^, 
while the interference generated to receivers k ^ Cj is 
treated as part of the background noise; 

• Serves the receivers in Vj C Cj with data. 

This coordination framework is characterized by the sets 
CjjVj, and the mnemonic rule is that T>j describes data from 
transmitter j while Cj describes coordination from transmitter 
j. To reduce backhaul signaling of data, the cardinality of Vj 
is typically smaller than that of Cj. These sets are illustrated in 
Fig. [T] and are selected based on long-term channel gains (see 
Q for details). To enable coordinated transmissions, perfect 
phase coherence and synchronous interference is assumed 



between transmitters that serve users jointly (see 1 36 1) 



' This model also applies to simple multi-antenna receivers that fix a receive 
beamformer (e.g., antenna selection) prior to transmission optimization. 
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Fig. 1. Schematic intersection between tiiree cells. BSj serves users in the 
inner circle CDj), while it coordinates interference to users in the outer circle 
(Cj). Ideally, negligible interference is caused to users outside both circles. 



The narrowband, flat-fading channel from BS, to MS^ is 



The combined channel from all transmitters is 
. The received signal 



denoted = [hf^. . . . hj^^hV G 



-<JVxl 



at MS I. is modeled as 



Kr 



Vk 



(1) 



k=l 



where the scalar-coded data symbol Sk for MSfe is assumed 
to be zero-mean and have unit-variance (without loss of 
generality). The block-diagonal matrix G (^nxn ^qIqq^^ 
the transmit antennas that send Sk and is defined as 



J diagonal block j 



if ke Vj, 
if Vj. 



(2) 



Observe that D^v^ is the effective beamforming vector, but 
we will optimize over v^. for notational convenience; any 
(reasonable) solution to the optimization problems herein will 
satisfy = DfeV^. 



Similarly, Cj. e 



^NxN 



selects signals from transmitters 



that have channel estimates to MS^ (i.e., those with non- 
negligible channels). This block-diagonal matrix is defined as 



[C 



A; J diagonal block j 



(3) 



if k e Cj , 
if k^ Cj. 

The noise and remaining (weak) interference are given by the 
circular-symmetric complex Gaussian term nj. G CJ\f{0, af,). 

The transmission (i.e., selection of beamforming vectors) is 
limited by L quadratic power constraints 

J2^kQi^k<qi (4) 

k=l 

where Q; e ^nxn ^j.g Hermitian positive semi-definite 
matrices for all /. To make sure that the power is constrained in 
all spatial dimensions, these matrices satisfy '^f'^i Q,i >~ On- 

A. Channel State Information and Robustness 

In practice, transmitters have uncertain CSI. The uncertainty 
originates from a variety of sources, including channel estima- 
tion, feedback quantization, hardware deficiencies, and delays 



in CSI acquisition on fading channels. It is common to assume 
an additive error model I111|-|[19) with 



(5) 



where hfe = [hf^ 



uT IT 
■ "-Ktkl 



-<Afxl 



is the uncertain CSI 
of the combined channel vector and e (j^JVxi 
the combined error vector This model can, for instance, be 
motivated by viewing channel estimation as the main source 
of uncertainty |j37 |j^ Observe that both the channel estimate 
and the error should be set to zero for all h.jk with k ^ Cj. 

The stochastic distribution for efe is unboundecj^ thus com- 
munication cannot be robust towards any error This is usually 
handled by only considering a subset of error vectors, the 
uncertainty set, that has high probability |TT]-fT9]. If this set 
is included in the resource allocation (i.e., optimization with 
acceptable outage probability), approximations are required to 
achieve tractable problem formulations iTTl-fTQl. Herein, we 
consider a fixed uncertainty set and maximize the worst-case 
performance over this set p4)-|[T6). This approach is conve- 
nient as it can provide convex problem formulations, but is 
often accused of giving conservative performance results | ,38J . 
However, this is the result of using ill- structured uncertainty 
sets and can be avoided by proper selection of these setsj^ 

For analytical convenience and motivated by channel es- 
timatioij^ |[l4)-|[T6), we concentrate on (compact) ellipsoidal 
channel uncertainty sets 



iYfc(hfc, Bfc) = |hfc : hfc = hfc 



(6) 



where e (^nxn (jgfjjjgs the shape of the ellipsoid. Since 
many uncertainty sources are independent between transmit- 
ters (e.g., estimation and quantization are done separately), 
Bfc is typically block-diagonal in multicell systems. However, 
the analysis herein is not limited to such B^,. Other types of 
compact uncertainty sets (including separate sets for each h.jk 



and probabilistic robustness) are discussed in Section III-C 

While Ui, . . . , Uk^ represent the CSI at the transmitter side, 
each MSfe is assumed to only have a local estimate of hfe. 
Thus, the receivers are unaware of co-user interference and 
precoding vectors, and are therefore assumed to be optimized 
by the transmitters and told how to process their received 
signals. Observe that the performance can be improved by, 
for example, estimating the optimal equalizers at each receiver 

^Under training-based MMSE channel estimation | 37|, the error takes the 
form of j5j. The stochastic error vector is £ CM(0, E^) under Rayleigh 
fading. If the channel from each base station to user k is estimated separately, 
then the estimation eiTor covariance matrix Efe becomes block-diagonal. 

^This holds for Rayleigh fading channels, while practical estimation errors 
of course are bounded but can be very large. 

*In the probabilistic approach, the guaranteed performance is maximized 
under a given outage probability. Using an optimal precoding solution, one 
can create a set U of all error vectors that gives exactly the optimal guaranteed 
performance (or better). If U is used as the uncertainty set in the worst-case 
approach, it will provide the same optimal precoding solution. 

'Continuing the estimation example in a previous footnote, recall that ej, £ 
CA/'(0, Efe). Thus, efe belongs with probability p to the ellipsoidal set {e^; : 
2e^Ejr"'^eft < Xp(2-'V)}, where Xp(") the p-percentile of the x^(")" 
distribution. If we limit the robustness to this set, the channel uncertainty 



is given by j6| using 



x|{2iV)„l/2 



To enforce higher or lower 



robustness to errors on channels from some base stations, one can use different 
weights on the diagonal blocks of Bj.. 
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based on the effective channels with preceding, but this 
requires additional training overhead that might be unavailable. 

B. Examples: Two Simple Multicell Scenarios 

The purpose of the above system model is to jointly describe 
and analyze a variety of multicell scenarios. Typical examples 
are ideal network MIMC[^ [39] (where all transmitters serve all 
users) and MISO interference channels p3) , pO) (with only 
one unique user per transmitter): 

1 ) Ideal Network MIMO: All transmitters serve and coor- 
dinate interference to all users, meaning that D/j ~ ~ In 
for all k. If a total power constraint is used, then L = 1 and 
Qi = lAf. If per-antenna constraint are used, then L — N and 
Qi is only non-zero at the kh diagonal element. If perfect CSI 
is available, then = 0^ and thus Uk = {h^} for all k. 

2) Two-user MISO Interference Channel: Let BSfe serve 
MSfc and coordinate interference to the other receiver Then, 



"1 



while Ci = C2 = I 



N- 



If each transmitter has its own total power constraint, then 
L = 2 and Qi ~ Di for I — 1, 2. If each transmitter estimates 
its channel independently, then a block-diagonal matrix — 
_^ is used to define the uncertainty sets 14 k- If channel 
estimation is the main source of uncertainty, then B^j is a 
scaled version of the estimation error variance for hj^ p7] . 
The scaling decides the amount of error that the system is 
robust to. 

C. User Performance 

The user performance is based on the MSB. MS^ uses an 
equalizing coefficient to achieve an estimate — rkUk 
of the transmitted data signal s^. Thus, the MSB in the data 
estimation at MSfc is MSE^ = E{|sfe — SfcP} and becomes 

MSB, = llrfehf C,[DiVi . . . D^^^v^^J - e^H^ + |r,|V2. 

nnise 



signal distortion 



■ interference 



(7) 



where denotes the fcth column of Ik,. ■ Bor MSB optimiza- 
tion, it suffices to consider real-valued > as any complex 
phase can be included in the beamforming vector vj. without 
affecting the MSB in (|7]l. A block diagram of the system model 
is shown in Big. |2] 

Since the MSB describes the average squared distance 
between and its estimate s^, it should be small. The range 
of reasonably MSB values is 



0<MSBfe<E{|sfe|2} = l 



(8) 



where the lower bound assumes negligible noise and interfer- 
ence, while the upper bound is the original signal variance. 

Herein, the performance of MS^ is measured by a con- 
tinuous function gk{MS>¥.k) of the MSB. Our convention is 

^Ideal multiuser coordination is commonly called network multiple-input 
multiple-output (MIMO), even in the case of single-antenna users. 

'We can always disregard the received signal by setting = and achieve 
MSEfc = 1, thus MSEfc > 1 is always suboptimal. 



Divi 



sk,. 




hf= 


hf- 










-1 



■IlK, 



■ Si 



Linear Precodhig Uncertain Channels Equalizers 

Fig. 2. Block diagram of the downlink multicell system. Linear preceding is 
applied to each data stream and Dfc decides which antennas that can transmit 
to user k. The channel uncertainty is modeled by additive errors e^, while 
Cf; removes negligible channels that are included in the additive noise Uk- 
User k applies the equalizing coefficient to estimate its data signal. 

that good performance means large positive values, thus gk{-) 
is a strictly decreasing function. Brom ([HJ, the function is 
bounded as 



= 9k{l) < 9k{MSEk) < gkiO) 



(9) 



where we assumed .gfc(l) — for notational convenience. 
Performance measures that can be expressed in this way are, 
for instance, bit error rate (BBR), data rate, SINR, and the 
MSB itself. If the equalizing coefficients rk are based on 
perfect CSI, there are simple expressions for these utilities 
pT|. CSI uncertainty makes it hard to derive closed-form 
expressions, but a simple relationship is given in f\A\ Lemma 
1]. 

The user performance is limited by the power constraints in 
(|4|, but also by co-user interference. The MSB in (|7]i improves 
if the interference is decreased, but this will degrade the MSBs 
for other users. Under worst-case robustness, this relationship 
is characterized by the robust performance region: 

Definition 3. The robust performance region TZ C M.^^'' is 



n = 



{(5i(MSEi),...,5k.(MSEkJ) : 

(vi,...,vkJ e V, rfe>OVfc} 



where the worst-case MSB is denoted 



MSBfc = min max MSB^, 1 
\hkeUk 

and V is the set of feasible transmit strategies: 
(vi, 



(10) 



(11) 



V 



^vfQiVfe V? L (12) 

k J 



This region describes the performance that can be guaran- 
teed to be simultaneously achieved by the users. The shape of 
the X^-dimensional region depends strongly on the effective 
channels, uncertainty sets, power constraints, and dynamic 
cooperation clusters. In general, it is a non-convex set, but 
it can be characterized as normal | |2T| : 

Definition 4. A set T C M" is called normal if for any point 
X € T, all x' e M" with x' < x also satisfy x' € T. 
Lemma 1. The robust performance region TZ with compact 
uncertainty sets Ui, . . . , Uk,. is a compact and normal set. 

Proof: The proof is given in Appendix [B] ■ 
This means that for any point x ^ TZ, all points that give 
weaker performance than x are also in TZ. This simplifies the 



*A function g : IR+ — > R is strictly decreasing if for any x,x' £ 
that X > x' it follows that g{x) < g{x'). 



such 
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search for points in TZ that yield good performance; they all 
lie on the upper boundary d~^Tl and this boundary is easy to 
identify since there are no holes in TZ. 

Definition 5. A point y is called an upper boundary point of 
a compact normal set T, if y G T while {y' S M" : y' > 
y} n T = 0. The set of all upper boundary points is called the 
upper boundary of T and is denoted 

To determine which point on d^TZ that is preferable, we 
need a system performance perspective. 

D. System Performance 

While the achievable user performance is represented by the 
multi-dimensional robust performance region TZ, the system 
performance is given by a function f : TZ ^ M. that takes a 
point in TZ as input and produces a scalar value. For a given 
point g = ((7i , . . . , ) E TZ, typical examples are 

. Sum performance: /(g) =J2k9k'-> 

• Proportional fairness: /(g) = rifcSfe^^ ' 
. Harmonic mean: f{g)^Kr{J2k9k^)~^' 

• Max-min fairness: /(g) = min^gj.. 

Weights can be included in these examples to compensate for 
heterogeneous channel conditions, delay constraints, etc. 

Herein, we assume that the system performance function 
/((7i(MSEi), . . . , or-,,(MSEa',,)) is Lipschitz continuous and 
strictly increasing This is satisfied by the aforementioned 
examples, and by all reasonable system performance measures. 
When combined with a performance region TZ that is compact 
and normal, we have the following important result. 

Lemma 2. If /(•) is a strictly increasing function and 7?. is a 
compact and normal set, the global optimum (if it exists) to 



maximize /(g) 



(13) 



is attained on d^TZ. In addition, for any g € d~^TZ there exists 
a strictly increasing /(•) with g as global optimum. 



Proof: The first statement is proved in |21 Proposition 



7]. The second statement is proved using the strictly increasing 
function /(g) = miufc gt/gk with g = (51, . . . , ffif J € d+TZ. 
Obviously, maXg^Tj /(g) > /(g) — 1 and assume for the 
purpose of contradiction that it exists g* E TZ that achieves 
strict inequality. This means that g* > g and thus g cannot be 
an upper boundary point since {y' G : y' > g} n 7?. 7^ 
(see Definition |5]). This contradiction yields maxggT^ /(g) = 
/(g) and thus g is the (non-unique) global optimum. ■ 
Based on this lemma, we only need to search the upper 
boundary of TZ to solve any system performance optimization 
problem. However, this is not as simple as it seems; |42| 
showed that sum performance maximization is NP-hard for 
any number of transmit antennas, while pO) showed NP- 
hardness for the harmonic mean and proportional fairness 
for Nj > 1. A main characteristic of NP-hard problems 
is that there are no known algorithms that solve them in 
polynomial time, and it is widely believed that there exist no 
such algorithms. 

'a function / : M" — > R is strictly increasing if for any x,x',x" G 
such that X > x' and X > x", it follows /(x) > /(x') and /(x) > /(x"). 



From pO) , | |42) it is fair to say that system performance 
optimization is generally NP-hard. However, there is a useful 
problem that can be solved efficiently (i.e., in polynomial 
time), namely the max-min fairness optimization (defined 
above) |20|. It belongs to a larger category of problems, robust 



III 



fairness-profile optimization, that we analyze in Section 
under channel uncertainty. It is also an essential subproblem 
of the BRB algorithm in Section|lV]that solves the monotonic 
optimization problem for any /(•), although the NP-hardness 
makes the convergence unsuitable for real-time applications. 

III. Robust Fairness-Profile Optimization 

In this section, we consider a particular /(•) for which ( [T3| ) 
can be solved efficiently and which is used as subproblem of 
the general BRB algorithm in the next section. The considered 
robust system performance optimization problem is 



maximize 
(vi,...,vit^)ev 

('-i,...,rKjeR+'- 



mm 

fc 



gfc(MSEfe) - flfc 



(14) 



subject to (7fc(MSEfe) > Ofe Vfc. 



This problem can be seen as a generalization of classic robust 
max-min optimization (see e.g., | ,14J ) where two fairness 
constraints have been added: 

1) Each user has a lowest acceptable level gk{MSEk) > a^; 

2) The total performance above this level is divided such 
that each user gets a predefined portion a/. > 0. 

The first constraint is represented by a = [ai, . . . , uk > 0. 
The second constraint is called a fairness-profil^j^ and is 
symbolized by a vector a 



[ai,. . . aK,.\ 



that satisfies 



J2k=i ^k — ^ (without loss of generality). 

We call ( |14| l a robust fairness-profile optimization (RFO) 
and observe that this problem has a simple geometrical inter- 
pretation; we start in a e 7?^ and follow a ray in the direction 
of a until a point on the upper boundary d^TZ is found p] 
In general search regions, the ray might leave the region and 
come back again which makes the search very complicated. 
Fortunately, TZ is a compact and normal set and thus the 
ray intersects the upper boundary in a unique point. This is 
illustrated in Fig. |3] where (a) and (c) are normal sets while 
(b) is non-normal and thus some rays from within the set cross 
the upper boundary multiple times. 

If we can find an upper bound /^pg"^ on the optimal value 
of ( [T4| i, we know geometrically that the optimum lies on the 
line-segment between a and a + q;/^pq'; see the illustration 
in Fig. [3] Hoping to simplify the RFO problem, we can thus 
rewrite p4ji as a bisection over this line-segment. 

Lemma 3. For compact uncertainty sets Ui, . . . ,Uk,^, fixed 
a, a and a given upper bound on the optimum of ( [T4] i, 

'"The terminology rate-profile has been used for similar problems in prior 
work |29| , |43| , |44| , but herein we extend these works by having arbitrary 
performance measures, uncertain CSI, and general multicell scenarios. 

"This geometrical approach finds an optimal solution to where 
(9ft(MSEf;)— aj;)/Q!j; Is the same for all MS^. In certain special cases (e.g., 
when the upper boundary is flat in some dimension), there also exist solutions 
where a few users get higher performance than this worst-user level. This 
discussed in 1451. 
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Fig. 3. Examples of robust performance regions with different sliapes. (a) 
Region is normal but non-convex, (b) Region is neither normal nor convex, (c) 
Region is both normal and convex. Simple bisection along a fairness-profile 
is not guaranteed to find the upper boundary of non-normal regions. 

the problem can be solved by bisection over the range F = 
[0, /rP^""]. For a given /^^'J^'''"'" G J", the feasibility problem 



find Vi , . . . , , f 1 > 0, , 
subject to MSEfc < 7^ V/c, 



E 

k 



(15) 



is solved for 7^ = 5fe ^(ofc +Q!fc/^o'''''"'). If the problem is 
feasible, all / e with / < /^p'J^"^^"' are removed. Otherwise, 
all f £ T with / > /^o'''''"' are removed. The initial 
feasibility of ([T4| is checked by solving ^ for /^"p'o"'''"" 0- 
Proof: From Lemma [T] 7?. is a compact and normal set. 
For such sets, a ray from a point within the region (in a positive 
direction) meets d^TZ in a unique point (see 1 ,21, Proposition 
6]). This point is the optimum to ([T4|, since the optimum must 
be on d^TZ (as proved in Lemma l2ji. As the ray only meets 
the upper boundary once, it can be divided into two parts: one 
part is inside of TZ and one part is outside. The intersection can 
be found (to any accuracy) by a line search (e.g., bisection) 
that iteratively checks if a point a+ct/^p'J^"^^"' is inside TZ by 
solving ( fTSj ). ■ 
Obviously, the RFO problem in ([T4]i is infeasible if a is 
outside of TZ, which can be checked as described in the lemma. 
A successful bisection also requires an initial selection of 
/rfo' i'^ Lemma [3] such that a+a/^pQ^ is outside TZ. If not 
given in advance, /^po"^ can be achieved in different ways: 

• /rpo' = Kr sup i^{gk{0) — ah). Cannot be used if 
supfcg/c(0) = 00. 

• /^o' = Efe.9fe(o-fc/('«fc||Dfhfc||^ + cr2))-afc, where Hk 
is a bound on the transmit power and is calculated as the 



smallest positive eigenvalue of ^'i among all /. 



f^o = Ek9k{MSE,..k) 



g,tr(Dfc) ^ _ 

flfc, where MSEsu,a: is the 
optimal robust MSE if MS^ is the only active user 
The first one is the simplest and ignores the power constraints, 
while the second one ignores co-user interference and uncer- 



tainty and assumes that the highest power available in some 
spatial direction can be used in any direction. The third one 
takes the MSEs achieved in a single-user system and requires 
that these problems are solved (which is simple under some 
power constraints), but achieves the tightest value on /^o^ 

A. Convexity of Feasibility Subproblems 

Solving the RFO problem using bisection, as suggested in 
Lemma [3] is appealing as the range is halved in each iteration; 
thus, the number of iterations scales only logarithmical with 
the desired accuracy 5 of the solution, also known as linear 
convergence. In other words, the computational complexity 
is typically not limited by the number of iterations but by 
the complexity of the feasibility problem ([TSj solved in each 
iteration. Next, we will see that ( fTS) can be solved efficiently. 

If the transmitters have perfect CSI (i.e., Uk ~ {hfc}), 
the feasibility problem in ( fTS] ) is convex |14| and can be 
efficiently solved (e.g., using general-purpose implementations 
of interior-point methods f?]); see Appendix |A] for further 
details. Under worst-case robustness to CSI uncertainty, the 
feasibility problem in ( [T5| ) seems difficult to solve since there 
are infinitely many MSE constraints (one for each hfe G Uk). 
Fortunately, the following theorem provides a reformulation 
into finitely many convex constraints, based upon well-known 
results from robust optimization |11|. 

Theorem \. For the compact uncertainty sets in (|6|, the 
feasibility problem in ( [T5| l is equivalent to the convex problem 

find vi , . . . , , ri > 0, . . . , fif^ > 0, 
Ai > 0, . . . , \k,. > 
subject to Afc >: 0]y+Kr+2 Vfc, (16) 



E 

k 



where V = [DiVi 

Afc = 



vf Q;Va; < qi \fl 



Da'.va',] and 



V 



IkTk- 





-fkek 



~ T 

-rkGk 



Ik-Tklx, 


-BfCfeV 











-V^CfB 













(17) 



Proof: The proof is given in Appendix [b] ■ 
This theorem only has one (linear) semi-definite constraint 
per user and has replaced the uncertainty set Uk by a variable 
Xk that indirectly represents the worst channel; if we can find 
Afc > that satisfies the constraint, then the original MSE 
constraints are satisfied for all ^Uk. 

Single-cell counterparts to Theorem [T] have recently been 
derived in |[T2)-p4), while the multicell generalization is 
novel. Special cases of the fairness-profile optimization prob- 
lem have also been considered before; if 5^ (MSEfc) = 
MSEjT^ — 1 and a = 0, the problem is equivalent to the 
minimization of the (weighted) worst MSE among all users 
Q, |[9|, IT3) , p4) . This special case can be posed as a 
generalized eigenvalue problem ||7), p4) , [ |46| , which can 
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improve the computational complexity. For general user utility 
functions gk{-), such simplification is not possible. 

The bisection algorithm for ( [l4| is summarized in Table |l] 

TABLE I 

Algorithm 1: Robust Fairness-Profile Optimization 



input starting-point a and fairness-profile ot 
input accuracy 5, f^pQ^ = 0, and (see suggestions) 

while f^"" - > S 

set /-"(f 



+/^r)/2 



lower ^candidate 

J RFO 



'RFO ^ JRFO 
set 7fe = 3fc ' (afe + Qfe/^™*''a.e) vfc 

if problem ifTsJ is feasible for these : set fg^pQ 
else: set f^= /-'^^w^'' end 
end 

return [fj^^^', /rfo"1 ^^^^ feasible solution to (B) from step 



B. Complexity of Feasibility Subproblems 

The previous section showed that the feasibility problem 
( [TSl l can be expressed as a convex problem under both perfect 
CSI and worst-case robustness. Thus, it can be solved with a 
computational complexity that is polynomial in the number of 
antennas N, users Kr, and power constraints L |47 Chapter 
6]. The exact complexity depends both on current systems 
conditions and the choice of solver algorithm (e.g., interior- 
point methods 1^4]), but we illustrate the complexity in Section 

Having CSI uncertainty will naturally increase the com- 
putational complexity, because Theorem [T] handles the in- 
finitely many MSB constraints by introducing extra variables 
Afe and because the resulting MSB constraints have larger 
dimension than under perfect CSI. In addition, the problem 
size under perfect CSI (and thereby the complexity) can be 
reduced by plugging in the optimal equalizing coefficients 
and rewriting the corresponding MSB expressions as second- 
order cone constraints |[7], |j8); see Appendix [A] Fixed point 
algorithms can provide fast solutions under perfect CSI with 
total power constraints flOl, but need to be combined with 
outer optimization procedures under general power constraints 
||8]. CSI uncertainty will, on the other hand, reduce the size 
of the performance region TZ, thus fewer feasibility problems 
need to be solved to attain a given accuracy 6. 

Finally, note that Lemma [5] solves \^og2ifs^^^ / feasi- 
bility problems to achieve the prescribed accuracy S on the 
solution to the RFO problem. This number is bounded by a 
constant, therefore the RFO problem also have polynomial 
complexity in the number of antennas N, users Kr, and power 
constraints L. This complexity is quite affordable, making the 
RFO problem a reasonable candidate for resource allocation 
in practical systems. To put it differently, the system designer 
basically has the choice between solving a RFO problem 
optimally or solving some other NP-hard resource allocation 
problem ( [T3| ) suboptimally. 

C. Extensions to the System Model 

The RFO approach is easily extended to any robustness 
scenario where the following properties are satisfied: 

• The performance region is compact and normal; 

• The feasibility problem in ( [T5| l can be solved efficiently. 



The first property is usually satisfied; observe that Lemma [T| 
proved it for any compact uncertainty sets (not only ellipsoidal 
sets). However, it is often difficult to solve the feasibility 
problem efficiently under general uncertainty sets. Tractability 
can be achieved through conservative approximations that give 
lower bounds on performance, see [13] , [14 J for examples 
with rectangular sets and intersections between ellipsoidal 
sets. If the worst-case robustness is replaced with probabilistic 
robustness constraints, conservative approximations of each 
user's performance are required to achieve tractable problem 
formulations |[T7)-|[T9). The probabilistic approach enables 
user performance functions based on the outage probability 
and outage data rate. Other possible extensions to the system 
model is Tomlinson-Harashima precoding (see |IT4l), soft- 
shaping constraints (see [[48J), and multi-carrier systems (see 
@). 



IV. Robust Monotonic Optimization 

In this section, we aim at solving the robust monotonic 
optimization problem in ( pj) for any system performance 
function. Recall from Lemma [2] that the optimum lies on 
the upper boundary. Hence, we can in principle look for an 
approximate solution in a large set of boundary points of TZ 



achieved by solving the RFO problem in Section III for a very 
fine grid of fairness-profiles a.. However, this naive approach 
has huge computational complexity, which calls for a more 
systematic approach that concentrates on the boundary in good 
directions. 

Next, we propose a branch-reduce-and-bound (BRB) algo- 
rithm for solving ( pj) systematically and with global conver- 
gence. It can be seen as an adaptation of the generic BRB 
algorithm in [22J to general multicell transmission. 

The algorithm maintains a set M with non-overlapping 
hyperrectangles that surely covers the parts of the robust 
performance region TZ where the optimal solutions lie (the 
solution might be non-unique). Iteratively, we split certain 
hyperrectangles and try to improve a lower bound /min and 
an upper bound /max on the optimal value of ( pj] i. To aid this 
process, a local upper bound I3{M) is also stored for each 
M. £ M. The algorithm proceeds until /max — /min < £, for 
a predefined solution accuracy e. 

In what follows, hyperrectangles are called boxes: 

Definition 6. For given a, b € M;^'' with a < b, the set of all 
X such that a < x < b is called a box and is denoted [a, b] . 

Initially, TV = {Mq} for a box Ma = [0, bo] C M^'"- where 
bo is based on some suitable upper bound that guarantees 
TZ C A^o (see suggestions in Section IIIi. The initial lower 



and upper bounds can be taken as /min ~ /(O) = and 
/max = /(bo), but some low-complexity resource allocation 
strategy can be used to obtain a better lower bound. 

Bach iteration of the BRB algorithm consists of three steps. 

1) Branching: Divide a box from JV into two new boxes. 

2) Reduction: Remove parts of these new boxes that cannot 
contain optimal solutions. 

3) Bounding: Search for a feasible solution in one of the 
new boxes and use it to improve /min, /max, and /?(•). 
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Upper Candidates 
m2r " ' ' 



mi 



Selected box Branching Reduction 



Bounding 



Fig. 4. An iteration of the BRB algorithm: A box is selected and branched 
into two new boxes. These are reduced based on the current bounds on the 
optimal value. Finally, line search between the lower and upper comers of 
the outmost box is applied to improve the bounds. 

These steps are illustrated in Fig. |4] and the details are 
explained in the next subsections. The final algorithm is given 
in Section |IV-D| For notational convenience, the fcth element 
of any vector x is denoted Xk- 

A. Branching 

Each iteration begins with selecting a box Aimax = 
] that contains the current upper bound /max: 



A^max = argmax 

MeAf 



(18) 



The intention is to improve the upper bound by partitioning 
■A^max into two new boxes TWi, A^2- New boxes of equal size 
are achieved by bisecting A^max along its longest side (see 
Fig. |4]). The index of this side is dim = argmaxj,(5max,fe — 
Omax k ) and the new boxes are 



Ml - [a; 
M2 = [a- 



max; bmax 



■S^dimJ ; 



(19) 



where s = (6max,dim - amax,dim)/2 and is the kth column 
of Ik^- The (local) upper bounds over these new boxes are 
also based on A^max^ 



(20) 



PiMl) = min(/3(7Wmax), /(bmax - SGdim)), 
/3(7W2)=/3(Xmax). 

Finally, the set A^max is replaced with A4i and A42 in J^- 
B. Reduction 

In this step, the new boxes A^j = [a;,b;], for ^ = 1,2, 
are reduced by cutting off parts that cannot achieve function 
values between the lower bound /min and (local) upper bound 
l3{M.i). If PiMi) < /mill, the whole box is removed from Af. 
Otherwise, it is replaced by a (potentially) smaller box [aj, bj] 
based on the following lemma. 

Lemma 4. If /min < P{Aii), all points g e [a;,b;] satisfying 
/min_ < /(g) < l^{Mi) are also contained in [a5,bj] C 
[a/,bi], where 



aj = b/ - ^ Vk(J)i,k - ai,k)ek (21) 
fc=i 

^'i=^'i+^Mhk-a'i^k)ek (22) 



with i/k and fik (for fc = 1, . . 

i^fc =max : 0<u <l, f{hi 



, Kr) calculated as 

- vik^k - ai^k)ek) > fn 



^fc=max|^ : 0<^<l,/(aj + n{bi^k ~ a'i^^)ek) < /3(A^;)}- 

(23) 

Proof: The proof is given in Appendix [B] ■ 
The reduction procedure in Lemma |4] is illustrated in Fig.|4] 

and observe that it needs to be implemented sequentially; first, 

the lower point a; is updated using ( 2T|i and then it is used to 

update the upper point b( using (|22i. 

Each reduction requires calculation of the parameters i^k , fJ-k 

in ( |23] l, generally solved by standard line search procedures. 

However, closed form expressions can be attained in many 

cases. For example, weighted sum performance with /(g) = 

J2k '^kgk (with weights Wk > 0) gives 



Ef^l Wkbi^k - fn 

WkCbi,k - ai^k) 



Vk 



fJ-k 



(24) 



-k=i ^ka^k 



1 



V Wk{hi,k - a'lj^) 
where the min-operator makes sure that VkT^k < 1- 

C. Bounding 

Each iteration ends with a bounding step where we search 
for feasible solutions in A42 = [a!,, by, which is the new box 
with the largest (local) upper bound (i.e., /3(A^2) > /3(A^i)). 
These solutions are used to improve /min, /max, and /3(A^2)- 

First, the feasibility of the lower corner is checked 
by solving ([TSj with "fk — 9k^{^'2k) ^^^^ problem 

is infeasible, then A^2 contains no feasible solutions and is 
removed from J\f. If the problem is feasible, the following 
lemma is used tojind lower and upper bounds on the feasible 
performance in A^2 by solving a single robust fairness-profile 
optimization problem (see Section |lll]): 

Lemma 5. Consider a box A4 = [a, b] C M:'|^'^ such that 
Ai n 7?, ^ 0. If 7?, is compact and normal, the highest feasible 
performance in Ai can be bounded as [/min,/max] for 



/min — /(a + Q:/rfo) 

/max = max/(b - {bk - nk)ek) 

k 



(25) 



fe=i 



where n = [tii, . . . , uk^] = a + a/^pcf , a = (b — a)/||b — 
a||i, and denotes the fcth column of I^-^. The variables 
/rfO' /rfcT '■^^ interval endpoints achieved by Algorithm 
1 with starting-point a, fairness-profile a, /^po' ~ ||b — a||i, 
and some given line-search accuracy S. 

Proof: The proof is given in Appendix |B] ■ 
The lemma is illustrated in Fig. |4j where a line-search is 
performed between the lower and upper corner of the box. The 
best feasible point on this line provides a local lower bound 
/min on the feasible performance. Since the region is normal, 
the outer points nik —^2~{b'2k~ ''^k)^k are candidates for 
giving a new upper bound on the feasible performance in the 
box. Observe that if the size of the box HbJ, — ayji is smaller 
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than the accuracy S, then no Hne-search is performed. This will 
not affect the convergence, as proved in the next subsection. 

The local lower bound /min from Lemma [s] replaces the 
global lower bound if /min > /min- Similarly, we set 
/3(A^2) /max if ./max < /3(A^2)- Finally, we update /max 
with the largest upper bound maxwieAA /3(A^) among the 
remaining boxes. The stopping criterion /max — /min < £ is 
checked before a new iteration is started. 

D. Final Algorithm 

The BRB algorithm that solves the general robust monotonic 
optimization problem in ( [T3] l is summarized in Table |n] 

TABLE II 

Algorithm 2: Branch-Reduce-and-Bound 



input A^o = [0, Smaxl/f^], accuracy e, line-search accuracy & 
set /min I /max based on the available prior Icnowledge 
set N = {Mo} and P{Mo) = /max 

while /max - /min > £ 

set yVlmax = argmax^gj^ P{M) 
fori = 1,2: _ _ 
create new box Mi using jl9) and set j3(MAas,mg \20) 
if /min reduce using Lemma l4| 

else: set Al; = end 
end 

Check feasibility of a.^ by solving {15) for 7^. = g^T^CSj j.) 

if feasible: 

Calculate bounds /min 1 /max in 2 using Lemma pi 

set /min^— niax(/min; ^/min) 

set /3(Aj2) = min(/3(X2), /max) 
else: set AI2 = end _ 

set AT = (AT \ A^max) U {Xl, A^2} 

set /max = max^g^ /3(A^) 
end 

return [/min i /max] and a feasible solution that achieved /min 



The BRB algorithm converges to the global optimum /^p, 
in the sense that an e-approximate interval /opt G [/min, /max]^ 
with /max — /min < is achieved in finitely many iterations 
for any e > 0. The line-search accuracy 5 is used in the 
bounding step to improve convergence speed, but there are 
no constraints on it to achieve convergence. 

Theorem 2. For any given accuracy e > 0, the BRB algorithm 
finds an interval [/min, /max] for the optimal value of ( [T3| l that 
satisfies /max — /min < £, in a finite number of iterations. The 
line-search accuracy 5 > can be selected arbitrarily. 

Proof: The convergence of the algorithm in Table |ll] can 
be studied as a standard branch-and-bound algorithm, treating 
the reduction step (which does not remove the solution) as part 
of the bounding step. In the appendix of pP] , two sufficient 
conditions are given for achieving an e-approximate solution 
in a finite number of iterations: 1) The bounding step truly 
calculates lower and upper bounds on the optimal value; 2) 
The difference /max — /min converges (uniformly) to zero. 
The first condition was proved in Lemma |5] while the second 
condition follows from the exhaustiveness of bisection and the 
Lipschitz continuity of / (i.e., ||b— a|| < constanti means that 
/(b) — /(a) < constant2). Finally, observe that bounding the 
performance in a box using only the lower and upper comers 
satisfies these conditions; thus, any 6 > can be used. ■ 
Although the algorithm converges, the worst-case conver- 
gence speed is exponential in the number of users Kr because 



the problem is NP-hard[^On the other hand, N and L are not 
affecting the convergence scaling of the BRB algorithm. The 
main computational complexity lies in the feasibility problem 
( [T5| ), which is solved individually in Step 11 and as part 
of Lemma |5] (a RFO problem) in Step 13. Under channel 
uncertainty, a convex formulation was given in Theorem [T] 
The dimension of the feasibihty problem can be reduced under 
perfect CSI and other special conditions (see Appendix [A] 
and Section IIIi, but as the BRB algorithm solves a long 



sequence of convex subproblems the total complexity makes 
it unsuitable for real-time applications. However, we show 
numerically in Section |VT] that the proposed algorithm has 
far better convergence behavior than the outer polyblock 
approximation in p4)-p7). 

Remark 1. The BRB algorithm in Table [11] is formulated to 
be applicable to any robust monotonic optimization problem. 
If only a certain type of user and system performance func- 
tions is of interest, this knowledge can be used to improve 
convergence. In particular, the bounding step should exploit 
any additional structure added to the problem. Instead of 
searching for feasible solutions on a straight line through the 
box (as the RFO does), the search can take place along any 
continuously elementwise-increasing curve between the lower 
and upper comer In addition, the feasibility problems can be 
simplified under perfect CSI and certain power constraints, as 



discussed in Section III-A The special case of weighted sum 
rate optimization and perfect CSI was recently considered in 
1 50 1 and | [5T) . Under a total power constraint, |50| formulated 
the search in a box as an approximate convex problem, 
which greatly improves the bounding step. With single-antenna 
transmitters, pT] showed that the feasibility problems can be 
solved by simply checking the spectral radius of a matrix. 
Finally, note that all the system model extensions discussed in 



Section III-C are possible for the BRB algorithm. 



V. Low-complexity Suboptimal Strategies 

In general multicell systems, it is easy to derive feasible 
suboptimal transmission strategies, but very difficult to evalu- 
ate their performance. The BRB algorithm in Section IV is 



suitable for computing benchmarks for such strategies. To 
evaluate the performance using some beamforming vectors 
(vi, . . . , vk^) and equalizing coefficients (ri, . . . , rx^), we 
need to know the robust MSB 7^, that each MS^ achieves 
(i.e., maxh^eWfc MSEfc < 7^). With notation 7^ = ^/jj:, the 
robust MSE 7^ is easily obtained by solving 



minimize 

^k>0,Xk>0 



subject to Afc >z Oat+x^ 



(26) 



+2 



which is a convex problem in 7^ and Afe. The matrix Afc is 
given in (Tt) using fk — r'^^^ and 7^ — y/jk- Next, we derive 
a low-complexity strategy based on interference constraints. 

'^Observe that the RFO problem avoids the exponential complexity by 
searching along one-dimensional curves in the ii'r -dimensional user space, 
while the BRB algorithm considers all dimensions. 
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A. Zero-Forcing and Interference-Constrained Beamforming 

A common way of simplifying transmission optimization is 
to constrain the co-user interference and to pretend that the 
channel estimates are the actual channels. This approach 
is powerful both from a computational perspective and in 
terms of performance in the high SNR regime |52J , but 
requires sufficiently many degrees of freedom (e.g., \Cj\ < 
Nj for all BSj). By pretending that the channel estimates 
are perfect, the "optimal" equalizing coefficient for MSfe is 
rfe = (vfDfCfh,)/(Es|hfCfeDfcV^.|2 and the 

"resulting" performance is MSEfc = {J2k^k I'^fe^CfcDj.vj.p + 

We consider ^^(MSEfe) = 1/MSEfc - 1 to achieve the 
following tractable problem formulation: 

maximize /(t^i, . . . , yy^J, 

subject to ' ^ > % Vfc, 

o-i + zk 



^ |hf CfcD^v^P < Vfc, 



(27) 



fc/fc 



^vfQ,Vfe<g, yi. 



The variables rjk were introduced to clarify that this problem 
can be solved optimally if /(•) is concave. If = Vfc, 
( pTj l gives the zero-forcing beamforming solution. Observe that 
Zk defines the maximal total interference that may be caused 
to MSfc, thus Zk > can be called interference-constrained 
beamforming. It could be suitable to select Zk > under CSI 
uncertainty since the actual interference cannot nulled anyway. 
The following lemma shows that ( |27] l can be solved as a 
convex optimization problem for any > Vfc. 

Lemma 6. The semi-definite relaxation of dZTll is 



(28) 



maximize fivi, ■ ■ ■ , Vk ), 

Vi?;0,...,VK,tO 

tr{Df Cfhfehf CfeDfeVfc} 
subject to --^ — k_kk — ^ > Vfc, 

ai + Zk 

J2 tr{Df Cf hfehf CD^Vfe} < Zk Vfc, 

J2tT{QiVk}<qi yi. 

k 

This problem is convex if /(•) is a (strictly increasing) concave 
function. It always has rank-one solutions = '^ki'^D^' 
where vj! solves the original problem in ( [27] l. 

Proof: The methodology in |61 and |53| can be used to 
show the existence of rank-one solutions. If an optimization 
procedure still delivers a high-rank solution V^., one can find 
by maximizing 5R{h^CfcDfeVfe} under the interference 
co_nstraints |hfCfcDfeVfc|2 < tr{Df Cf h^hf CfeDfcV*} 
Vfc 7^ fc and power constraints vf^QiVk < tr{Q;V^} ■ 
Based on Lemma |6] we can solve the zero-forcing problem 
for all concave system performance functions, for example, 
sum performance, proportional fairness, and max-min fairness. 
This can be viewed as an extension of the work in ||53l 



to multicell systems with arbitrary power constraints. Zero- 
forcing and interference-constrained beamforming are subop- 
timal, but their simplicity make them good low-complexity 
strategies and ( |27] i actually provides the optimal solution under 
perfect CSI if we happen to select the interference constraints 
optimally (see |54J for a systematic method to search over 
{zfc}). The performance under uncertain CSI is evaluated 
in Section 



VI 



Observe that zero-forcing and interference- 
constrained beamforming can be used to achieve an initial 
lower bound for the BRB algorithm and thereby speed up the 
convergence. 

VI. Numerical Examples 

In this section, the robust monotonic optimization frame- 
work is evaluated numerically, using the YALMIP toolbox of 



1 55 1 and the numerical convex optimization solver SDPT3 
from Q. First, the concept of robust performance regions 
and system performance functions are illustrated. Then, the 
performance of the zero-forcing approximation strategy in 
Section |V] is compared with the optimal solution, calculated 
using the proposed BRB algorithm. Finally, the computational 
complexity of solving the RFO problem is exemplified and 
the convergence of the BRB algorithm is compared with the 
outer polyblock approximation algorithm. 

While this section concentrates of illustrating different 
aspects of the proposed robust optimization framework, it 
is worth noting that our BRB algorithm was applied for 
performance benchmarking in realistic multicell systems with 
20 users in Q. 

A. Robust Performance Regions 

To illustrate the shape of the robust performance regions, 
we consider a simple network MIMO scenario with = 2 
users. The total number of transmit antennas is = 3, all 
channels are Rayleigh fading and spatially uncorrelated, and 
we use per-antenna constraints with qi = 10 (i.e., 10 dB). 
The average SNR E{\\hk\\l} / al is N for user 1 and N/A for 
user 2, creating an asymmetry that will highlight properties of 
different system performance functions. Spherical uncertainty 
sets Uk(t>-k, Bfc) are assumed with Bfc = v^Iat in (|6]l, where 
the parameter y/^ is the radius of the sphere. If one standard 
deviation of the channel estimation error is used as uncertainty 
set f37|, then ^ equals the estimation error variance. 

Fig. [5] shows the robust performance regions for a random 
channel realization and different ^. In Fig. 5(a)[ the inverse 
MSE is the user performance measure (i.e., ^^(MSEfe) = 
MSE^^ — 1 to make 5fe(l) = 0), but the figure axes show MSEs 
to enhance viewing. The guaranteed data rate ^^(MSEfc 



log2(MSEj, ^) is the user performance measure in Fig. 5(b) 



In both figures, the optimal system performance points are 
shown for the four measures exemplified in Section II-D sum 



performance, proportional fairness, harmonic mean, and max- 
min fairness. 

Robustness towards channel uncertainty clearly decreases 
the size of the performance regions, without affecting the 
general shape (in this scenario). The optimal points of the four 
system performance measures are all on the upper boundaries 
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(a) User Performance Measure: Inverse MSE, gfe(MSEfc) = MSEj, 
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(b) User Performance Measure: Data rate, gj;(MSEj;) = log2(MSE^^) 

Fig. 5. Robust performance regions with different squared radius § of 
the channel uncertainty sets. The user performance measure is either (a) the 
inverse MSE or (b) the guaranteed data rate. The optimal points with different 
system performance functions are shown. 



(confirming Lemma |2]i, but at quite different places. By 
introducing user weights in the system measures, the optimal 
points can be moved around on the upper boundary; in fact, 
the upper boundaries in Fig. |5] were generated by solving max- 
min fairness optimization problems for a large set of weights 
(i.e.. Algorithm 1 with different fairness-profiles). 



B. Evaluation of Zero-Forcing Beamforming 

Next, we evaluate the performance and robustness of the 
zero-forcing and interference-constrained beamforming ap- 
proaches in Section [V] (with maximal acceptable interference 
Zk = and > 0, respectively). The optimal solution, 
derived by the BRB algorithm, is used for benchmarking. We 
consider a scenario where Kt = 2 base stations (with Nj — 3 
antennas each) jointly serve Kr = 6 users. The channels hjk 
are modeled as uncorrelated Rayleigh fading. The users are 
located such that E{||hjfe||2}/cr^ is Nj for half the users and 
Nj/2 for the others, and vice versa for the other base station. 
Per-base station constraints are considered with the power qi 
as a parameter that will be varied. Proportional fairness of 
the user MSEs is used as system performance measure, thus 
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Fig. 6. Robust MSE with proportional fairness optimization (i.e., geometric 
mean). Zero-forcing and interference-constrained beamforming is compared 
with the optimal solution for different squared radius ^ of the channel 
uncertainty sets. 

the geometric mean MSE is minimized|^This is achieved by 
having .gfe(MSEfc) = MSE^^-l and /(g) = nj^fc + l)'/^'-. 

Fig. [6] shows the performance as a function of qi (the 
power per transmitter) and with spherical uncertainty sets with 
Bfc = v^Ijv- For small values on ^ (i.e., low uncertainty), the 
zero-forcing approach provides close-to-optimal performance 
in the high SNR regime. At low SNR and for larger values on 
^, zero-forcing is clearly suboptimal. Interference-constrained 
beamforming with Zk = {Kr ~ l)tr{BfeBf } {K,. - 1)N( 
achieves better performance in these regimes, but the differ- 
ence decreases with SNR which indicates that should be 
adapted to the SNR. In summary, zero-forcing is good at high 
SNR and robust to small channel uncertainties. By allowing 
interference Zk > 0, interference-constrained beamforming 
achieves better performance at low SNR and larger uncertain- 
ties. 

C. Computational Complexity of RFO 

Next, the computational complexity of the robust fairness- 
profile optimization problem is evaluated. We consider the 
same multicell scenario as in the previous subsection, but we 
fix the transmit power per base station at 10 dB and vary 
the number of users K,. G {4, 8, 12, 16, 20} and number of 
transmit antennas Ni = N2 e {4, 8, 12}. Recall that the 
RFO problem is solved by iterating the feasibility problem 
in ( [T5| l until a given line-search accuracy is achieved (10- 
15 iterations usually give a good accuracy, but it depends 
on the user performance functions). Therefore, Fig. [T] shows 
the average computational time for solving this subproblem. 
The simulation was performed at a standard PC running 
Linux/Ubuntu with an Intel Core2Duo Q8400 with 2.66 GHz 
using all four cores. 

Fig. [7] shows how the computational time increases with 
both the number of users and total number of antennas. It is 
clear that the complexity under channel uncertainty (i^ = 0.1) 
is several times higher and has a steeper slope than under 
perfect CSI. This was expected from the discussion in Section 



III-A since the convex representation of the MSE constraints 



Under perfect CSI, minimizing the geometric mean MSE is identical 
to maximizing the sum rate. This equivalence does not hold under channel 
uncertainty, but a lower bound on the sum rate is maximized |14[ . 
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has larger dimensions under uncertain CSI. Comparing all the 
scenarios, it is clear that the computational time spans from a 
fraction of a second to a fraction of a minute; thus, the RFO 
problem can be solved quite efficiently (even at a standard PC 
with a general-purpose numerical solver) and is applicable for 
future real-time applications. 

D. Convergence Evaluation of BRB Algorithm 

In this subsection, the convergence of the proposed BRB 
algorithm is compared with the outer polyblock approxima- 
tion algorithm in p6|, (27) and with a simple brute force 



10' 



approach! I As these algorithms are rather different, it is 
not meaningful to compare the number of iterations. Instead, 
we consider the performance as a function of the number 
of feasibility evaluations of the type in ([TSj. This convex 
subproblem is the main source of complexity in all the three 
approaches. 

We first compare the BRB algorithm with brute force and 
consider the same multicell scenario as in the previous subsec- 
tion. Fig.[8]shows the average number of feasibility evaluations 
(to achieve a relative error of 0.1) as a function of K,.- We 
let the number of antennas scale with the number of users as 
Nj = Kr/2 (we also have 6 = 1). Fig. [s] reveals that the BRB 
algorithm, despite the unavoidable exponential complexity in 
Kr, is much more efficient than a brute force approach. 
We also observe that channel uncertainty slightly reduces the 
number of evaluations, basically since the performance region 
becomes smaller 

Finally, we compare the BRB algorithm with the outer 
polyblock approximation algorithm of |26|, f2n\ and consider 
a similar scenario as in these papers. Thus, we have Kt = 2 
transmitters with TVi = A^2 = 3 antennas and perfect CSI. 
Each transmitter serves two unique users (i.e., Kr — 4), 
while coordinating interference to all users. The average SNR 
E{ II hjfe II i}/cr^ is Nj if user k e Vj and Nj/2 if k ^ V^. Each 
transmitter has its own total power constraint with qj — 10 
(i.e., 10 dB for single-user transmission) and the sum rate is 
chosen as performance measure. 

Fig. [9] shows the average relative deviation^ (over 250 
channel realizations) of the lower and upper bounds on the sum 
rate as a function of the number of feasibility evaluations. The 
BRB algorithm is used with a line-search accuracy S of either 
0.1 or 1, while (5 = 0.1 was used for the polyblock algorithm. 
Both algorithms quickly find feasible solutions within a few 
percent from the optimal value, but many evaluations are 
required to achieve a tight upper bound. However, the proposed 
BRB algorithm shows much faster convergence in both the 
lower and the upper bound; after 5000 feasibility evaluations, 
the polyblock algorithm has still not reached the accuracy 
that the BRB algorithm achieved with 1000 evaluations. This 
is consistent with observations in f22], where the difference 
is also claimed to increase with the number of users. Thus, 

'''with brute force we mean dividing tlie initial box of tlie BRB algoritlim 
into (very many) subboxes such that the difference between the lower and 
upper comer is less than e in each box. We solve one feasibility problem per 
box to find the optimum. 

'^If /opt is the optimal solution, the relative deviations of the lower and 
upper bound are (/min - /opi)//opt and (/max - /opt)//op(, respectively 
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Fig. 7. Example of the average computational time of solving the feasibility 
problem in \\5) with uncertain and perfect CSL Different number of users 
and total number of antennas A'^ = Nj are considered. 



in terms of achieving an ^-approximation on the optimal 
performance, the proposed BRB algorithm shows much faster 
convergence. For the BRB algorithm, 6 — 1 gives faster 
convergence than 5 = 0.1, indicating that having many 
iterations with loose bounds sometimes is more efficient than 
having few iterations with tight bounds. 

VII. Conclusions 

This paper presented an optimization framework for re- 
source allocation in multicell MISO downlink systems with 
general power constraints and robustness towards channel 
uncertainty. For any system performance measure that in- 
creases monotonically in each user's performance, the pro- 
posed branch-reduce-and-bound (BRB) algorithm solves the 
resource allocation to global optimality. In each iteration, a 
line-search is performed in the robust performance region — 
a quasi-convex fairness-profile optimization problem that can 
be solved efficiently. Since most multiuser resource allocation 
problems are non-convex and NP-hard, the BRB algorithm 
is mainly suitable for computing benchmarks due to high 
computational complexity. The benchmarking capability was 
illustrated numerically by comparing it with a simple zero- 
forcing approximation. In addition, the BRB algorithm was 
shown to provide far better convergence than the previously 
known outer polyblock approximation algorithm. 

Appendix A 
Improvements under perfect CSI 



Both 
Section 
Section 
(i.e., Uk 
can be 
ficients 

optimal 



the robust fairness-profile optimization problem in 
[Till and the monotonic optimization framework in 
IV|can be directly applied to the case of perfect CSI 
= {hj:}). However, the computational complexity 
reduced by observing that optimal equalizing coef- 
can be achieved by differentiation of MSE^ in (|7|: 
(vf Df Cf h,)/(Es |hf C,D,v,|2 + al). If this 



value is plugged into (|7]i, we achieve 



MSEfe = 



Es|hfCfcD,v,P + a2 



(29) 
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Fig. 8. Example of the average number of feasibility evaluations required 
to find the optimal solution (with a relative error of 0.1) with uncertain and 
perfect CSL 
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Fig. 9. Relative error of lower and upper bounds on the sum rate as a 
function of the number of feasibility evaluations. 

Based on this MSE expression, the feasibility problem in ( [TS] ) 
of Lemma [3] can be replaced by 

find vi,...,vk^ 

subject to ^ vf QiVfc < 



Ik 



Ik 



(30) 



This new feasibility problem is also convex, because the 
MSE constraints have been turned into second-order cone 
constraints (see ||5], ||7j, |[8| for details on this approach). 
However, (|30]l has fewer optimization variables which means 
lower computational complexity. Thus, (|30| should always be 
used instead of ( fT5| ) under perfect CSI. 

Appendix B 

Proof of Lemma |7| To prove that the set is normal, take 
X = (xi, . . . ,xk^) G Tl and assume that {rl}^Jl^ and 
{^k}k=i ^ feasible solution that attains this point. We want 
to show that any x' — {x[ , . . . , x'j^ ) G M.^'' with x' < x also 
belongs to TZ. To this end, we fix the beamforming vectors 
at {v^l^^j^ and search for equalizing coefficients {rx',k}k=i 
that gives MSE/j = 7^ for all fc, where 7^ = g^^{x'i^). 
For a given channel realization h^, denote the MSE in ^ 



as MSEA;(rfe, hfe). Observe that MSEfc(rfe, h^) is a second- 
order polynomial in rk that has a unique minimum and then 
approaches infinity continuously as 00. Thus, we can 

solve the second-order equation MSEfe(rfe, h^) = 7^. to derive 
the largest root 



ak 



+ V4-i^-l'k)bk 



where 



3?(hfCfeDfeVfe) and hk 



(31) 



,h^Cfe[DiVi 
be real-valued, 
increases with 



D 



+ af.. This solution will 
because it is real-valued for x' = x and 
7^.. Since we consider the largest root. 



MSEfe(rfe,hfc) > for all > <(hfc). By selecting 



^k = min rj,(hfe) Vfc 



(32) 



we can make sure that 5fc(MSEfc) 



and thus that x' e TZ. 



Next, we prove that 7?. is a compact set. First, observe that 
the set of feasible beamforming vectors, V, in ( [T2) l is compact. 
Next, observe that it is sufficient to search for equalizing 
coefficients rk in the compact set £k = [0, l/cTfc], since greater 
values make the noise part of the MSE in (|7]) larger than one 
(and thus, MSE^ > 1). The MSEs are continuous functions of 
the beamforming vectors and equalizing coefficients, and the 
performance functions gk{MSEk) are continuous by definition. 
Therefore, (7fc(min(maxhj.GWfc MSE^, 1)) is continuous for any 
compact set U^. Finally, we invoke p6] Theorem 4.14], which 
says that the continuous mapping of a compact set is also a 
compact set. Since TZ is the image of a continuous mapping 
from V and £k, the robust performance region is compact. 

Proof of Theorem |7| The proof is based on the following 
well-known result in robust worst-case optimization theory: 

Lemma 7. Given A, P, Q, with A = A^, the expression 



A ^ P^ZQ 
holds if and only if 

3A e H 



Q"Z"V VZ:||Z||2<£' (33) 



s.t. 



A-AQ^Q -qP" 



>- 0. 



(34) 



Proof: The proof is given in | [57] Proposition 2]. ■ 
Lemma I2] can be used to reformulate each MSE constraint 



max MSEfc < 7^ 



(35) 



in ( fTSj ) as a semi-definite constraint. First, we replace with 
hfc + B^efe in the MSE expression of (|7]i. Next, we apply 
Schur complement lemma |58, Theorem 1.12] to rewrite ( |35| ) 
as 



VWk 

a-k 



hfCfeV- 



-1„T 



efc 











e?BfC.V0' 



'k - 





V^C^BkEk 



*k 

Ok 




hO Vgfc : llelU < 1. 



(36) 

Finally, we apply Lemma |7] with A being the first matrix in 
(|36]), P = [0 Bf CfcV 0], Q = [-1 0], Z = gfe, and 
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Q — 1. The obtained reformulation of ([36| is the constraint in 
( fTSj l. By optimizing over fk = r'jl^ instead of r^, we observe 
that this constraint is Hnear in Vi, . . . ,Vfc^,fk- Thus, the 
reformulated problem is convex. Further details are available 
in similar proofs (under different system assumptions) in for 
instance |11|-|14|. 

Proof of Lemma |4[ First, consider the reduction of the box 
from [ai,b;] to [aj,b;] (i.e., from below). If the boxes are 
identical, no solutions are lost and we are finished. Otherwise, 
a; < a'l with strict inequality in at least one element. For 
elements with strict inequality we have i^k < 1, while i^k — ^ 
holds for all other elements. There exist g G [a/,b;] such that 
g ^ [aj,b/]. For any such g there is a dimension k such that 
gk < a'lj. and Vk < 1. Thus, g < hi-D{bi,k-ai,k)ek for some 
D with J^fe < I' < 1. For the system performance function, we 
have 

/(g) < /(bi - Hbi.k - ai,k)Gk) ^^^^ 

< fi^i - Vk{hi^k - ai,k)ek) ^ /min- 

The strict inequality follows since I'k is selected to be the 
largest value that gives equality in the set defined in ( |23] l. 
From ( |37| ) it is clear that any g removed in the reduction (from 
below) will have a function value strictly below /min- The 
reduction from above is proved analogously. Finally, observe 
that [aj,bj] C [a;,b;] since each element in aj and bj are 
calculated as convex combinations of a; and b;. 

Proof of Lemma^ The line-search procedure in Algorithm 
1 finds the best feasible solution (with accuracy S) on the line 
segment between a and b. Thus, a + cc/^q is feasible and 
can be used for a lower bound on the performance. Similarly, 
n is either on the upper boundary or infeasible. Since TZ is 
normal, there are no feasible points x e with x > n. The 
corner points where all element but one are larger than in n 
are b — (6^ — nk)e.k for fc = 1, . . . , Kr- These can be used to 
calculate an upper bound on the performance. 
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